Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

userguide: explain rule types and categorization - v9 #12209

Closed

Conversation

jufajardini
Copy link
Contributor

Previous PR: #12184

Link to ticket: https://redmine.openinfosecfoundation.org/issues/
https://redmine.openinfosecfoundation.org/issues/7031

Please check the built version: https://suri-rtd-test.readthedocs.io/en/doc-sigtypes-et-properties-v9/rules/rule-types.html

Describe changes:

Many of the examples and conclusions documented here were derived from tests and checks as seen on OISF/suricata-verify#2153

Add documentation about the rule types introduced by 2696fda.

Add doc tags around code definitions that are referenced in the docs.

Task #https://redmine.openinfosecfoundation.org/issues/7031
@jufajardini jufajardini added the typo/doc update No code change : only doc or typo fixes label Dec 3, 2024
Copy link

codecov bot commented Dec 3, 2024

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 83.18%. Comparing base (e9173f3) to head (7c79e48).
Report is 103 commits behind head on master.

Additional details and impacted files
@@            Coverage Diff             @@
##           master   #12209      +/-   ##
==========================================
+ Coverage   83.17%   83.18%   +0.01%     
==========================================
  Files         912      912              
  Lines      257111   257111              
==========================================
+ Hits       213856   213885      +29     
+ Misses      43255    43226      -29     
Flag Coverage Δ
fuzzcorpus 61.01% <ø> (-0.01%) ⬇️
livemode 19.41% <ø> (-0.01%) ⬇️
pcap 44.40% <ø> (+0.01%) ⬆️
suricata-verify 62.77% <ø> (-0.01%) ⬇️
unittests 59.17% <ø> (-0.01%) ⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Comment on lines +95 to +104
* - IP Only
- Flow
- Once per direction
- On IP addresses on the flow
- Source/ Destination field of a rule
* - IP Only (contains negated address)(*)
- Flow
- Once per direction
- On IP addresses on the flow
- Source/ Destination field of a rule, containing negated address
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Action is flow if flow exists. Packets that are not part of a flow, .... will always be inspected per packet.

- Source/ Destination field of a rule
* - IP Only (contains negated address)(*)
- Flow
- Once per direction
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

All packets.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Test this with rule profiling to confirm.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe we can have a SV test with like-ip-only

  • alert rule: several alerts
  • drop/pass: should be applied to the flow

- Packet
- Per-packet basis
- Packet-level info (e.g.: header info)
- ``itype``, ``tcp.hdr``, ``tcp.seq``, ``ttl`` etc.
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

tcp-pkt - add as keyword example here

- Flow, if stateful (**)
- Per stream chunk, if stateful, per-packet if not.
- Against the reassembled stream. If stream unavailable, match per-packet.
(stream payload or packet payload)
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

move to Inspected, be stream payload AND packet payload

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

exposes the stream and/or payload data.

* - Type
- Action Scope
- Inspected
- Matches
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Data exposed?


* - Type
- Action Scope
- Inspected
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Inspection hook?

- Keyword Examples (non-exhaustive)
* - Decoder Events Only
- Packet
- Per-packet basis
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Per-broken/ invalid packet

* - Decoder Events Only
- Packet
- Per-packet basis
- Packets that are broken on an IP level
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Decoding events

- ``content`` with ``startswith`` or ``depth``
* - Stream
- Flow, if stateful (**)
- Per stream chunk, if stateful, per-packet if not.
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Stream chunks, or just packets (if not accepted by stream engine)

* - Stream
- Flow, if stateful (**)
- Per stream chunk, if stateful, per-packet if not.
- Against the reassembled stream. If stream unavailable, match per-packet.
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

exposes stream reassembled payload or packet payload data

Comment on lines +185 to +187
- Application Layer (`app_layer`)
- Protocol Detection Only (`pd_only`)
- Decoder Events Only (`de_only`)
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

the ` is wrong.

Comment on lines +189 to +191
The rule examples provided further cover some such cases, but the table below
lists those keywords with more details:

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

check detect-flowbits.c:632. some cases will make a packet rule become a stateful / app_tx signature.

@jufajardini
Copy link
Contributor Author

@njlavigne There will be a new version of this coming out soon to take into account a review done with Victor, but do feel free to add feedback, please, if you see anything weird or missing :)

aspects aforementioned.

.. list-table:: Suricata Rule Types
:widths: 10, 17, 18, 30, 25
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Review width for last 2 columns, especially.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

the rtd theme isn't properly rendering list-table widths properly - it's ignoring this, basically.

The best workaround I've found was using nested paragraphs, as it seems that the width is being based on the column header, and this seems to be working.

@jasonish
Copy link
Member

jasonish commented Dec 3, 2024

Given the embedded code, etc. I wonder if this is a little too deep to from the rule chapter. Is it better suited at the end with some "rule internals" or part of a Suricata internals guide? At first I was wondering devguide, but I do realize there is some internals that should be documented with a power user audience, and not necessarily a developer audience.

@suricata-qa
Copy link

Information: QA ran without warnings.

Pipeline 23656

@jufajardini
Copy link
Contributor Author

Given the embedded code, etc. I wonder if this is a little too deep to from the rule chapter. Is it better suited at the end with some "rule internals" or part of a Suricata internals guide? At first I was wondering devguide, but I do realize there is some internals that should be documented with a power user audience, and not necessarily a developer audience.

We have Making sense out of alerts, right after the two Rules chapters. Maybe we could have this part about the rule types before or after that one? I understand it has more advanced stuff, but I think semantically it also makes sense that it remains close to the other chapters. imho, the code excerpts that we show aren't mandatory to understand the contents - more to add hooks for those who like to dig...

@jasonish
Copy link
Member

jasonish commented Dec 4, 2024

Given the embedded code, etc. I wonder if this is a little too deep to from the rule chapter. Is it better suited at the end with some "rule internals" or part of a Suricata internals guide? At first I was wondering devguide, but I do realize there is some internals that should be documented with a power user audience, and not necessarily a developer audience.

We have Making sense out of alerts, right after the two Rules chapters. Maybe we could have this part about the rule types before or after that one? I understand it has more advanced stuff, but I think semantically it also makes sense that it remains close to the other chapters. imho, the code excerpts that we show aren't mandatory to understand the contents - more to add hooks for those who like to dig...

Maybe move it to the end of the rules chapter? My thought are:

  • Making sense of alerts is more about reading the alert. Could be a different audience than rule writers. I think the large majority of our users are not rule writers, but read the alerts.
  • Reading documentation in order. If this new info it must have reading material before writing your first rule, keep it where it is, otherwise if its for the smaller few that really need to know whats going on, move it down

To be fair, advanced rule writing could be its own book I'm sure!

Comment on lines +199 to +203
- ``to_server``, ``to_client``
- no type change
* - ``flow``
- ``established``, ``not_established``
- to `packet`
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ip_only and like_ip_only behave differently here.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Are there cases where a "packet" type rule changes to a stream-type rule when a content keyword is added? What type is a rule that contains both content and flow:to_server?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

With much delay, thanks for the questions. I ended up taking a detour to better understand some corner cases that affect rules with flowbits, but now am back to this main work.

Given two packet rules:

#Packet rules

alert tcp any any -> any 443 (flow: to_server; flowbits:isset,tls_error; sid:1603; msg:"Allow TLS error handling (outgoing packet)"; )
alert tcp any any -> any 443 (flow: to_server; flowbits:set,tls_error; sid:1604; msg:"Allow TLS error handling (outgoing packet) - non-stateful rule"; )

Modifying them by adding flow:to_server and some simple content check, or just the content check will make them stream rules:

#Stream Rules

alert tcp any any -> any 443 (flow: to_server; flowbits:set,tls_error; content:"abc"; sid:1605; msg:"Allow TLS error handling (outgoing packet) with simple content - Stream rule";)
alert tcp any any -> any 443 (flow: to_server; content:"abc"; sid:160401; msg:"Allow TLS error handling (outgoing packet) - stream rule";)
alert tcp any any -> any 443 (content:"abc"; sid:160402; msg:"Allow TLS error handling (outgoing packet) - stream rule";)

So, an otherwise pkt rule with a content check will become a stream rule, from my tests.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I will make sure to add examples like these to the doc.

Comment on lines +147 to +150
Although both are related to matching on application layer protocols, since
Suricata 7, a Protocol Detection rule (that uses the ``app-layer-protocol``
keyword) is not internally classified the same as a simple rule matching on
the application layer protocol on the `protocol` field.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

These two rule types are similar but work differently - could you provide any guidance as to when you might prefer to use one over the other? What are they intended to be used for?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I can be very wrong, but, considering this documentation and the one for the app-layer-protocol keyword, what comes to my mind is...
app-layer-protocol is better suited for situations such as checking for protocol detection failure, protocol negation, or even app-proto changes.
On the other hand, it is only inspected once per direction, when protocol detection is done, so it is unsuitable for per-packet checks, in which using the protocol field of the rule would probably work best.

I'll confirm if this explanation is correct, and try to add something about that in the docs, too

Comment on lines +134 to +137
.. note::
(*) IP Only signatures with negated addresses are `like` IP-only signatures, but
currently handled differently due to limitations of the algorithm processing
IP Only rules.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What is the significance of this to me as a rule writer? Are these two types functionally different, or is the only difference just an internal optimization?

If you use the idea above for a section per rule type, special notices like this could go into those sections which might help it flow naturally with the other information you are providing per rule type.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What is the significance of this to me as a rule writer? Are these two types functionally different, or is the only difference just an internal optimization?

The next version elaborates more on differences that exist and impact analyzers and possibly rule writers, those were missing in this table.

If you use the idea above for a section per rule type, special notices like this could go into those sections which might help it flow naturally with the other information you are providing per rule type.

As you pointed out, considering we already have a section per rule type for the examples, it does make sense to add the notes and a brief description for each type under such. Will improve that, thanks :)

Comment on lines +67 to +71
The ``SignatureSetType()`` overall flow is described below:

.. image:: intro/OverallAlgoHorizontal-20241127.png
:width: 600
:alt: A flowchart representing the SignatureSetType function.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Overall I think this new page does a good job of outlining what each rule type does when applied to traffic, but for rule writers the other piece of relevant background is "what influences a signature be a particular type" and to me that part still isn't as clear. I think the flow chart is great and helps with this but the branch conditions like "Is PDOnly" and "Is IpOnly" don't have enough supporting detail to understand when they will apply.

I think it would help to include a short description for each rule type stating at a high level what the types are used for, for example

The IPONLY rule type is used for rules that match only on source & destination IPs and port numbers, and not on any other flow or content modifier.

It probably wouldn't fit in the table so it would likely need to go somewhere else, like in a short section introducing each type. Maybe could combine that with the examples so that each type would have a small section that combines both an explainer and the examples for that type.

Comment on lines +22 to +23
.. list-table:: Suricata Rule Types, and their Engine Analysis Term
:header-rows: 1

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Have you considered merging the two rule type tables into one table?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I know it's not ideal as is, but I wanted to avoid having even more columns, as some of the cels in the other column are quite busy with text and Read the Docs doesn't offer a lot of flexibility for adjusting column width...

^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

Aside from the scope of action of a signature, certain rule conditions will
require that it matches against a *real packet* (as opposed to a *pseudo packet*).

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is it worth saying what the difference is here (or link to it, if it's explained elsewhere in the docs)?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Fair. tried something.

Comment on lines +198 to +200
* - ``flow``
- ``to_server``, ``to_client``
- no type change

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You can correct me if I'm wrong, but I believe there is at least one example where this does change - I think a rule that would otherwise be "iponly" type does change to something else when a flow:to_server or flow:to_client is added.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You're absolutely right, what would be an ip-only or [like] ip-only rule will become a packet rule if a flow keyword is used.

Comment on lines +199 to +203
- ``to_server``, ``to_client``
- no type change
* - ``flow``
- ``established``, ``not_established``
- to `packet`

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Are there cases where a "packet" type rule changes to a stream-type rule when a content keyword is added? What type is a rule that contains both content and flow:to_server?

Comment on lines +127 to +132
* - Application Layer Protocol Transactions
- Flow
- Per transaction update
- On buffer keywords
- Application layer protocol-related, e.g. ``http.host``, ``rfb.secresult``,
``dcerpc.stub_data``, ``frame`` keywords

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What exactly is a transaction is a topic that would be very relevant here, for understanding how these rules will be matched against traffic. You have a good explanation of them here...I understand that the developer guide is aimed at a different audience than this page (aimed at rule writers), but they would both benefit from understanding that.

Copy link
Contributor Author

@jufajardini jufajardini Jan 13, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

For now, do you think that it helps if we add a reference to that explanation here, while adding a ticket to have an explanation about transactions from a user-centered perspective?

@njlavigne
Copy link

@njlavigne There will be a new version of this coming out soon to take into account a review done with Victor, but do feel free to add feedback, please, if you see anything weird or missing :)

Thanks for tagging me, please feel free to keep doing it for changes you think I'd be interested in!

@jufajardini
Copy link
Contributor Author

Finally a new version for this work, trying to address comments here: #12411

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
typo/doc update No code change : only doc or typo fixes
Development

Successfully merging this pull request may close these issues.

4 participants